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Abstract. Consider the ensemble of Real Symmetric Toeplitz Ma- 
trices, each entry iidrv from a fixed probability distribution p of mean 
0, variance 1, and finite higher moments. The limiting spectral mea- 
sure (the density of normalized eigenvalues) converges weakly to a new 
universal distribution with unbounded support, independent of p. This 
distribution's moments are almost those of the Gaussian's; the deficit 
may be interpreted in terms of Diophantine obstructions. With a lit- 
tle more work, we obtain almost sure convergence. An investigation 
of spacings between adjacent normalized eigenvalues looks Poissonian, 
and not GOE. 
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1. Introduction 

One of the central problems in Random Matrix Theory is as follows: con- 
sider some ensemble of matrices A with probabilities p{A). As — oo, 
what can one say about the density of normalized eigenvalues? For Real 
Symmetric matrices, where the entries are iidrv from suitably restricted 
probability distributions, the limiting distribution is the semi-circle. Note 
this ensemble has independent parameters (aij,i < j). For matrix 

ensembles with fewer degrees of freedom, different limiting distributions 
arise (for example, McKay |McK| proved (i-regular graphs are given by 
Kesten's Measure). By examining ensembles with fewer than N"^ degrees 
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of freedom, one has the exciting potential of seeing new, universal distribu- 
tions. In this paper we investigate Symmetric Toeplitz matrices. 

Definition 1.1. A Toeplitz matrix is a matrix of the form 
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• b^-i 
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We investigate symmetric Toeplitz matrices whose entries are chosen ac- 
cording to some distribution p with mean 0, variance 1, and finite higher 
moments. The probability density of a given matrix is Y{^=o Pih)- 

By looking at Trace(A^) = Ylii (^)' we see that the eigenvalues of A 
are of order V^. As the main diagonal is constant, all 60 does is shift each 
eigenvalue. Therefore, it is sufficient to consider the case where the main 
diagonal vanishes. 

To each Toeplitz matrix, we may attach a spacing measure by placing a 
point mass of size at each normalized eigenvalue: 



llA,N{.x)dx = 




The A;*'* moment of ha,n{x) is 

1 ^ 

Let Mk(N) be the average of Mk(A, N) over the ensemble, with each A 
weighted by its density. We show that Mk{N) converges to the moments of 
a new universal distribution, independent of p. The new distribution looks 
Gaussian, and numerical simulations and heuristic sketches at first seemed 
to support such a conjecture. A more detailed analysis, however, reveals that 
while Mk{N) agrees with the Gaussian moments for odd k and A; = 0, 2, 
the other even moments are less than the Gaussian. 

We now sketch the proof. By the Trace Lemma, 

N 

^Af(A) = Trace(A'=) = ^ «n,i2«i2,i3 • • • (4) 
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As our Toeplitz matrices are constant along diagonals, depending only 
on\im — in\, we have 

Mk{N) = — E{b\i,^i^\b\i^_i,\---b\i^_i,\), (5) 

l<ii,- ,ik<N 

where by E(- ■ ■ ) we mean averaging over the Toeplitz ensemble, with 
each matrix A weighted by its probability of occurring, and the bj are iidrv 
drawn fromp(x). 

We then show that as — * oo, the above sums vanish for k odd, and 
converge independent of p for k even to numbers bounded by the mo- 
ments of the Gaussian. By showing E[\Mk{A, N) - Mfc(iV)|™] is small for 
m = 2 (m = 4), we obtain weak (almost sure) convergence. 

Remark 1.2. This problem was first posed by Bai BBail . where he also 
asked similar questions about Hankel and Markov matrices. Almost surely 
the methods of this paper would be applicable to these cases. Bose and 
Bryc-Dembo-Jiang have independently observed that the limiting distribu- 
tion is not Gaussian. Using a more probabilistic formulation, [BDJ| have 
calculated the moments using uniform variables and interpreting results 
as volumes of solids related to Eulerian numbers. We have independently 
found the same numbers, but through Diophantine analysis, which allows 
us to interpret the deviations from the Gaussian in terms of Diophantine 
obstructions, and estimate the rate of convergence. 

2. DETERMINATION OF THE MOMENTS 

2.1. k = 0,2 and k odd. For all N, Mo{A, N) = Mo{N) = 1. For k = 2, 
we have 



1<«1.«2<V l<ii,«2<V 

As we have drawn the 6s from a variance one distribution, the expected 
value above is 1 ifii ^ i2 and otherwise. Thus,M2(A^) = ^^^'^ ~ 
Note there are two degrees of freedom. We can choose 6|ji-j2| to be on any 
diagonal. Once we have specified the diagonal, we can then choose ii freely, 
which now determines Z2- 

For k odd, we must have at least one bj occurring to an odd power. If one 
occurs to the first power, as the expected value of a product of independent 
variables is the product of the expected values, these terms contribute zero. 
Thus, the only contributions to an odd moment come when each bj in the 
expansion occurs at least twice, and at least one occurs three times. Hence, 
if A; = 2m + 1, we see we have at most m + 1 degrees of freedom, this 
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coming from the case 6i 6l • • • 6? . There are m different factors of h, and 
then we can choose any one subscript. Once we have specified a subscript 
and which diagonals we are on, the remaining subscripts are determined. 
As all moments are finite, we find 

M2m^l{N) «^ .Li N^^^ (7) 



2.2. Bounds for the Even Moments. We proceed in stages in calculating 

M2k{N), 2k > 4. First, we bound A'hkiN) by 2'' ■ 2^ ■ {2k - 1)!!, where 
{2k — 1)!! is the 2k^'' moment of the Gaussian. We then show that each 
factor of 2*^ can be removed, and then show a strict inequality holds. 

^2/c(iV) = ^ E(6|,,_,,|6|,,_,3|---6|,,,_,,l). (8) 

l<ii,--- ,i2k<N 

If any bj occurs to the first power, its expected value is zero and there 
is no contribution. Thus, the bjs must be matched at least in pairs. If any 
bj occurs to the third or higher power, there are less than k + 1 degrees of 
freedom, and there will be no contribution in the limit. 

The bjS are matched in pairs, say = Let Xm = \im - 

im+i \ — \in — in+i\- There are two possibilities: 



There are k such pairs, thus we have 2^^ choices of sign. Further, there 
are {2k — 1)!! ways to pair off 2k numbers into groups of two. 

Fix a choice of sign and a pairing. Once we specify Xi, . . . , and any 
one index, say ii, all the other indices are almost determined (if the choices 
are consistent). There is one remaining freedom. After we've chosen which 
differences to match and the values of these differences and the choice of 
signs, for each time when there is a negative sign, there is one additional 
choice: does the positive or negative difference occur first? Thus, after we 
specify for each pair whether the positive or negative difference occurs first, 
then all the indices are determined. 

Therefore, there are N'^^'^ degrees of freedom. If all the xjs are distinct, 
we have the expected value of the second moment of p, k times. These 
contribute at most 

_L_ . 2^= . 2'=(2A; - l)m''+^ = 2'=(2A; - 1)!!. (10) 

If some of the xjs are equal, we have fewer than k+1 degrees of freedom. 
We now have the expected value of a product of moments of p, which is 
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finite and independent of N. These terms will not contribute in the limit. 
Therefore 



lim M2k{N) < 2^ -2^(2^-1)!!. (11) 

We now remove the factor of 2'' coming from the choice of signs. Con- 
sider a pairing of the bjS. We claim the only term which contributes in the 
limit is when all signs are negative. 

Let Zi, . . . , Xfc be the values of the \ij — ij+i|s, and let ei, . . . , be the 
choices of sign (see Equation|51). Define xi = ?i— ^2, ^2 = ^2—^3, • • • , 5?2fc = 
i2k — ii- Note exactly one Xj is Xj and exactly one is ejXj. We have 

12 = ii - xi 

13 = ii-xi~ X2 



h = h-xi X2k- (12) 

Therefore 



k 

Xi-\ \-X2k = '^{l + ej)xj = 0. (13) 

i=i 

If any ej = 1, then the Xj are not linearly independent, and we have 
fewer than A; + 1 degrees of freedom; these terms will not contribute in 
the limit. Thus, the only valid assignment is to have all the signs negative. 
There are now 2'^ possible choices of order (whether the negative or positive 
difference occurs first), giving 2^ ■ A^^+^. We eliminate 2'' by changing our 
viewpoint. 

We have k + 1 degrees of freedom. We match our differences into k pairs. 
Choose ii and 22- We now look at the freedom to choose the remaining in- 
dices ij. Once ii and 22 are specified, we have ii — i2, and a later difference 
must be the negative of that. If ^2 — ^3 is matched with ii — Z2, then is 
uniquely determined (because it must give the opposite of the earlier differ- 
ence). If not, ^3 is a new variable. Now look at i^. If — is matched 
with an earlier difference, then the sign of its difference is known, and ^4 is 
uniquely determined; if this difference belongs to a new pair not previously 
encountered, than is a new variable and free. Proceeding in this way, we 
note that if we encounter i„ such that — i„ is paired with a previous dif- 
ference, the sign of its difference is specified, and i„ is uniquely determined; 
otherwise, if this is a difference of a new pair, in is a free variable, with at 
most choices. Thus we see there are at most A^'^^^ choices (note not all 
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choices will work, as for example the final difference i2n — h is determined 
before we get there, because of earlier choices). 

More explicitly, having k -\- 1 degrees of freedom does not imply each 
term contributes fully - we will see there are Diophantine obstructions 
which bound the moments away from the Gaussian's. However, each pair- 
ing and choice of sign contributes at most N^'^'^, and we have shown 

M^kiN) < (2A;-l)!! + 0fc(^^^. (14) 

2.3. The Fourth Moment. The fourth moment calculation highlights the 
Diophantine obstructions encountered, which bound the moments away 
from the Gaussian. 

= ^ XI ^{h^-i2\h^-iz\hz-iAhA-h\) (15) 

l<ii,i2,i3,j4<iV 

Let Xj = \ij — ij+i I . If any b^^ occurs to the first power, its expected value 
is zero. Thus, either the Xj are matched in pairs (with different values), or 
all four are equal (in which case they are still matched in pairs). There are 
3 possible matchings; however, by symmetry (simply relabel), we see the 
contribution from xi — X2, X3 — X4 is the same as the contribution from 

Xi = X4, X2 = x^. 

If xi = X2, X3 = Xi, we have 

H-k = -(k-h) and ^3-^4 = -(«4-«i)- (16) 

Thus, ii — is and ^2 and ^4 are arbitrary. Using these three variables as 

our independent degrees of freedom, we see there are A^^ such quadruples. 
Almost all of these will have xi 7^ x^, and contribute E(6^^6^.J = 1. Given 
ii and ^2. — 1 choices of ^4 yield Xi 7^ X3, and one choice yields the two 
equal. Letting p4 denote the fourth moment of p, we see this case contributes 

^,{NHN-l).l^N^.p,) = 1-1 + ^ = 1 + 0(1). (17) 

The other possibility is for xi = x^ and X2 = X4. Non-adjacent pairing 
is what leads to Diophantine obstructions, which decreases the contribution 
to the moment. Now we have 

H-k = -(is-id and i2 - is = -(U-h)- (18) 
This yields 

ii ^ i2 + k-i3, ii,i2,i3,k & {i, ■ ■ ■ ,N}. (19) 
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The fact that each ij e {1, • • • , N} is what leads to the Diophantine ob- 
structions. In the first case, we saw we had three independent variables, 
and A^^ + O(iV^) choices that were mutually consistent. Now, it is possible 
for choices of i2, is and to lead to impossible values for ii. For exam- 
ple, if ^2,^4 > ^ and ia < y, we see ii > N. Thus, there are at most 
(1 — ^)A^^ valid choices. This is enough to show the Gaussian moment is 
strictly greater; later we will see that if there is one moment less than the 
Gaussian, all larger even moments are also smaller. 

The following lemma shows this case contributes | to the fourth moment. 

Lemma 2.1. Let /at = {1, ... , A^}. Then y,zelN-'i-<x+y — z< 
N} = |Ar3 + 1 AT. 

Proof. Say X + y = S e {2, . . . , 2N}. ¥or 2 < S < N, there are - 1 
choices of z, and for S > + 1, there are 2N — S* + 1. Similarly, the 
number of a;,!/ G In withx + y = SisS-lifS < N+1 and2A^-5' + 1 
otherwise. The number of triples is 



N 2N 
5=2 S=N+1 



-A^^ + -N. 
3 3 



(20) 



□ 



Collecting all the pieces, we have shown 
Theorem 2.2 (Fourth Moment). Let p4 be the fourth moment of p. Then 

Mm -4^'-^^^,- (21) 

2.4. Sixth and Eight Moments. Any even moment can be explicitly deter- 
mined by brute-force calculation, though deriving exact formulas as — > oo 
requires handling involved combinatorics. To calculate the higher moments, 
consider 2k points on the unit circle, and look at how many different shapes 
we get when we match in pairs. We find Mq{N) = 11 (compared to the 
Gaussian's 15), and M8{N) = 64^ (compared to the Gaussian's 105). For 
the sixth moment, there are five different configurations: 
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These occur 2, 6, 3, 3 and 1 time, contributing 1, |, 1, |, and | (respec- 
tively); these correspond to the 15 = (6 — 1)!! pairings. For the eight 
moment, the smallest contribution is |, coming from the matching xi — xs, 
X2 = .T4, X5 = x-j, xq = xs- It seems the more crossings (in some sense), 
the greater the Diophantine obstructions and the smaller the contribution. 

3. Upper Bounds of High Moments 

3.1. Weak Upper Bound of High Moments. 

Lemma 3.1. For2k > 4, limTv^oo M2fc(A^) < {2k - 1)!!. 

Proof. Once we find a pairing that contributes less than 1 for some moment, 
we note that it will lift to pairings for higher moments that will also con- 
tribute less than 1. Say we have such a pairing on 6|ii-i2| • • ■ &|i2feo-ji| giving 
less than 1. We extend this to a pairing on 2k > 2ko as follows. We now 
have 



^\h-i2\ ' ' ■ ^|i2fco-l-i2fcol^N2fco-*2fco+ll^K2fco+l-»2fco+2| ' ' ' ^1*2^-1 -i2fc I ^1*2*: -«i I ' 

(22) 

In groups of two, pair adjacent neighbors from b^i^^^^^^i^^^^^i to bii^^_^_i^^i. 
This implies i2ko — ^2feo+2 = • • • = i2k- Thus, looking at the first 2ko — 1 
and the last factor gives 

b\ii-i2\ ' ' ' b\i2kg-i-i2ko\b\i2k-H\ ~ ^|n-i2| ' ' ' ^|j2fco-l-i2fco |^N2fco-«<l ' ^^^^ 

Now pair these as in the pairing which gave less than 1, and we see this 
pairing will contribute less than 1 as well. □ 

3.2. Strong Upper Bound of High Moments. In general, the further away 
one moment is from the Gaussian, the more one can say about higher mo- 
ments. While we do not have exact asymptotics, one can show 

Theorem 3.2. lim„_,oo (aSyn = 0. 
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Proof. We will show that for any positive integer c, for k sufficiently large, 
as A?^ — > oo the moment is bounded by (|)'^(2A; — 1)!!. We have shown 
that we may take as independent variables the k values of the subscripts 
of the fejS . . . , Xk) and any index. The goal is to show that almost all 
of the pairings, for k large, have at least c Diophantine obstructions (of 
the type encountered in the fourth moment). If there were no obstructions, 
these terms would contribute N^; the obstructions reduce the contribution 
to lN\ 

We strategically replace our set of independent variables id,xi, . . . ,Xk 
with new variables which exhibit the obstructions. We give full details on 
dealing with one obstruction, and sketch how to add more. For simplicity, 

instead of referring to ii, ^2, • • • , i2k, we use i, j, k,. . . andp, g, r, Thus, 

in the trace expansion we have terms like ai^i^ — 6|ii-i2|; we refer to this 
point by i or by ij. 




Say we pair b\i^j\ with h\q-r\. Let xi = i — j = —{q — r). If we knew 
i = j + r — q, with j, r and q independent free variables, then our earlier 
results show there are only ^N^, not N^, solutions. Unfortunately, j, r and 
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q need not be independent; however, for almost all of the (2/c — 1 ) ! ! pairings, 
they will be. 

Create a buffer zone around ij and qr of two vertices on each side, and 
assume that neither buffer zone intersects. Given ij, there are {2k — 1) — 8 
possible choices to place qr. Now connect the neighbors of ij and qr such 
that nothing is connected within one vertex of another. There will be {2k — 
0{l))-{2k-0{l))- {2k^O{l))-{2k-0{l)) such pairings. Note that, as we 
start placing some of these connections, some vertices become unavailable. 
For example, say there is exactly one vertex between the buffer of ij and 
the buffer of qr. This vertex is not available for use, for if we were to place 
another vertex there, the indices it gives would not be independent. The 
same would be true if there were just two vertices between the two buffers, 
and so on. In each case, however, we only lose 0(1) vertices. As all these 
pairings are separated, we may label their differences by 2:2, 2:3, and x^, 
independent free variables. 

The point is that the separation allows us to replace some the indepen- 
dent variables x^ with j, r and q. Note that each index appears in exactly 
two vertices on the circle, and they are adjacent. Thus, these are the only 
occurrences of i,j, q, r and we may replace x^ with q, X4 with r, and Xi 
with j. We now have the desired situation: i — j + r — q, with all three on 
the left independent free variables. 

There are {2k — 11)!! ways to pair the remaining vertices. For those pairs 
that have j, q, r independent, the contribution is at most |A^^ • iV*^+^~^; for 
the others, we bound the contribution by N'^'^^. Hence 



M2k{N) < 



{2kf{2k - ll)!!-A^*^+i + 0{k^) ■ {2k - 11)!! • N^+^ 
3 



< |p,-l)!! + o((^). (24) 



Therefore, 



^-W < l + ori l. (25) 



(2A;-1)!! - 3 \k ^ 
There are two ways to handle the general case with c Diophantine ob- 
structions. One may start with enormous buffer zones around the initial 
pairs. As the construction progresses, we open up more and more portions 
of the parts of the buffer zones not immediately near the vertices. This keeps 
all but 0(1) vertices available for use. Alternatively, along the lines of the 
first construction, we can just note that by the end of stage c, Oc(l) vertices 
were unusable. We will still have the correct power of 2k, with a correction 
term smaller by a factor of | . □ 
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4. Lower Bound of High Moments 

4.1. Preliminaries. By obtaining a sufficiently large lower bound for the 
even moments, we show the limiting distribution has unbounded support. In 
particular, we must find a lower bound L2k such that limfc^oo = oo. 

We know the moments are bounded by those of the Gaussian, {2k — 1)!!; 
the limiting value of the 2A;-th root of the Gaussian (by Stirling's Formula) 
is |. We will show v^X^ ^ k^^" in the limit. 

The construction is as follows: in studying the 2k-th moment, we are led 
to sums of the form 



1 



E 



E 



N N 



12, 13 "'«2fc,»l 



.«1 = 1 «2fc = l 

■ N N 



.n=i «2fe=i 



(26) 



If any h\i„~i^^^\ occurs only once, as it is drawn from a mean zero distri- 
bution, there is no contribution to the expected value. Thus, the 2k numbers 
(the 6s) are matched in at least pairs, and, to obtain a lower bound, it is 
sufficient to consider the case where the differences are matched in k pairs. 
Let these positive differences (of |z„ — i„+i|) be a;i, . . . , Xk- 

In Section IZ2l we showed the matchings must occur with negative signs. 



Thus, if \in 



, then iir. 



We let 



In 



j + l- 



Thus, for any Xj, there is a unique ji such that Xj^ 



X 



3' 



and a unique j2 such that We call the first set of differences 

positive, and the other set negative; we often denote these by Xp and x„, and 
note that we have k of each. 

We have k + 1 degrees of freedom. We may take these as the k differences 
Xk, and then any index, say ii. We have the relations 



12 = ii — Xi 

is = ii -xi - X2 

i2k = ii-xi X2k- (27) 

Once we specify ii and the differences xi through X2k, all the indices are 
determined. If everything is matched in pairs and each ij E {1, . . . , N}, 
then we have a valid configuration, which will contribute +1 to the 2A;-th 
moment. The reason it contributes +1 is because, as everything is matched 
in pairs, we have the expected value of the second moment of p{x),k times. 
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Thus, we need to show the number of valid configurations is sufficiently 
large. The problem is that, in Equation ITTl each index ij E {1, . . . , A^}; 
however, it is possible that a running sum ii — xi — ■ ■ ■ — is not in this 
range for some m. We will show that we are often able to keep all these 
running sums in the desired range. 



4.2. Construction. Let « e (|, 1). Let = {1, . . . , A}, where A = 
Choose each difference Xj from I a', there are A'^ ways to do this. In the 
end, we want to study A;-tuples such that no value is chosen twice. Note 
such tuples are lower order, namely there are at most (2)^4'^^^ such tuples. 
This is 0(A^^^^). As ii takes on at most values (not all values will in 
general lead to valid configurations), we see tuples with repeated values 
occur at most 0{N'') times; as we divide by A^^+^, these terms will not 
contribute for fixed k as N ^ 00. Thus, with probability one (as N — > 00), 
we may assume the k values Xj are distinct. 

Let us consider k distinct positive numbers (the XjS) drawn from Ia, giv- 
ing rise to k positive differences XpS and k negative differences x^s. Let 
us make half of the numbers xi, . . . ,Xk positive (arising from the XpS), and 
half of these numbers negative (arising from the x„s). Call this the first 
block (of differences). 

Then, in the differences Xk+i, • • • , (the second block), we have the re- 
maining differences. Note every positive (negative) difference in xi, . . . , x^ 
is paired with a negative (positive) difference in Xk+i, ■ ■ ■ , X2k- Note we 
have not specified the order of the differences, just how many positive (neg- 
ative) are in the first block / second block. 

Note two different /c-tuples of differences xj cannot give rise to the same 
configuration (if we assume the differences are distinct). This trivially 
follows from the fact that the differences specify which diagonal of the 
Toeplitz matrix the cti^j^+iS are on; if we have different tuples, there is 
at least one diagonal with an entry on one but not on the other. 

Let us assume we have chosen the order of the differences in the first 
block, xi, . . . , Xfe. We look at a subset of possible ways to match these with 
differences in the second block. In the second block, there are | positive 
(negative) differences Xp (x„). There are (|) ! ways to choose the relative or- 
der of the positive (negative) differences. Note we are not giving a complete 
ordering of the differences in the second block. There are k\ > (|)!^ ways 
to completely order. We are merely specifying the relative order among 
the positive (negative) elements, and not specifying how the positive and 
negative differences are interspersed. 
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Thus, the number of matchings, each of which contribute 1, obtainable 
by this method is at most 

N ■{A'' -0{A^-^))-{k/2y:^, (28) 

where N is from the possible values for ii. A'' — 0{A''~^) is the number 
of A;-tuples of distinct differences xj E I a, and (fc/2)!^ is the number of 
relative arrangements of the positive and negative differences in the second 
block (each of which is matched with an opposite difference in the first 
block). 

Not all of the above will yield a +1 contribution to the 2A;-th moment. 
Remember, each index must be in {1, ... , A^}. We now show that for a 
large number of the above configurations, we do have all indices appropri- 
ately restricted. We call such a configuration valid. 

4.3. Number of Valid Configurations. Most of the time, the sum of the 

positive differences Xp in the first block will be close to the negative of the 
sum of the negative differences x„ in the first block. 

Explicitly, we may regard the XpS (x„s) as independent random variables 
taken from the uniform distribution on 1^ (—Ia) with mean approximately 
I A (—^A) and standard deviation approximately j^A. By the Central 

Limit Theorem, for k large, the sum of the | positive (negative) XpS (x„s) in 
the first block converges to a normal distribution with mean approximately 

^ (— ■^) and standard deviation approximately ■ 

Thus, for N and k sufficiently large, the probability that the sum of the 
positive differences in the first block is in — ^ + ^^^] is at least 
I (and a similar statement for the negatives). Thus, of the tuples, at least 
^A'' will have the sums of the positive (negative) differences lying in this 
interval (in the negative of this interval). We call such choices good. 

Remember, in the arguments leading up to Equation |28l we only speci- 
fied two items. First, the absolute values of the k differences (all distinct); 
second, that half the positive differences are in the first block, and the rel- 
ative orderings of the positive (negative) differences in the second block is 
given. 

Thus, we have freedom to choose how to intersperse the positives and 
negatives in the first and second blocks. Consider a good choice of x^s. 
We place these differences in the first block of length k as follows. Choose 
the first positive difference from our good list, and make the first difference 
positive. Keep assigning (in order) the positive differences from our good 
list until the running sum of the differences assigned to the first block ex- 
ceeds A. Then assign the negative differences from our good list until the 
running sum of differences in the first block is less than — A. We then assign 
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positive differences again until the running sum exceeds A, and so on. We 
assign half the positive (negative) differences to the first block. 

Throughout the process, the largest the running sum can be in absolute 
value is max(2A, 2 • |^). This is because the | positive (negative) dif- 
ferences yield sums whose negatives are very close to each other, and each 
added difference can change the running sum by at most ±A. 

We now assign the differences in the second block. We have already 
chosen the positive and negative differences. There are (|)! orderings of the 
positive (negative) differences. We choose these relative orderings, and now 
choose how to intersperse these. We put down the differences, again making 
sure the running sum never exceeds in absolute value max{2A, 2 ■ |^)- 

Let ii = 0. From Equation |27j we now see that each index is at most 
2max(2A,2--^). Therefore, each index is in ^^^] . Thus, 

if we shift ii so that ii E 



' 7 N N 
8 fc"-^ ' fc"-2 



as a > I for k large all indices 

will now be in {1, ... , A^}. Thus, this is a valid assignment of indices. 
We now count the number of valid assignments. We see this is at least 

To calculate the contribution to the 2A;-th moment from this pairing, we 
divide by N''^^. If any of the differences are the same, there is a slight 
complication; however, as is large relative to k, we may remove the small 
number of cases (at most (2) A'^) when we have repeat differences among 
the XpS and x„s. By Stirling's Formula, the main term is 



(30) 

Thus, the 2A;-th root looks like ^i+iogT > 0(A;^^"), proving the support 
is unbounded. 

5. Weak Convergence 
We need to show that the variances tend to 0. Thus, we must show 

lim (E[MUA,Ny]~E[MUA,N)]^) = 0. (31) 
As M^(A, A^) = ^^4+TTrace(A"), we have 
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E[M„(AiV)'] 



1 



X] XI ^hh-i2\- ■ ■b\im-il\b\jl-j2\- • ■b\jm-jl\\ 



]\[m+2 



l<il,...,i,n<N l<jl,...,jm<N 



E[Mm{A,N)]^ 



1 



]ym+2 




(32) 



There are two possibilities: if the absolute values of the differences from 
the is are completely disjoint from those of the js, then these contribute 
equally to E[M„(A, A^)^] and E[M^(A, A^)]^. We are left with estimating 
the difference for the crossover cases, when the value of an — ia+i — 

±(j/3 - J/3+1 )• 

We assume m — 2k; a similar proof works for odd m. Note N^'^'^ — 
^2fc+2 -pjjg following two lemmas imply the variance tends to 0. As our 
moments grow slower than the Gaussian, we satisfy the conditions neces- 
sary to obtain almost surely weak convergence. 

Lemma 5.1. The contribution from crossovers in E[M2k{A, A^)]^ is Ok{jf)- 

Proof. For 'E[M2k{A, N)], the expected value vanishes if anything is un- 
paired. Thus, in E,[M2kiA, N)]"^, in the is and js everything is at least 
paired, and there is at least one common value from a crossover. The maxi- 
mum number of such possibilities occurs when everything is paired on each 
side, and just one set of pairs crosses over; for this crossover there are 2 
ways to choose sign. In this case, there are + 1 degrees of freedom in the 
is, and k + l~l degrees of freedom in the js (we lost one degree of freedom 
from the crossover). Thus, these terms give 0(A^^^+^). Considering now 
matchings on each side with triple or higher pairings, more crossovers, and 
the two possible assignments of sign to the crossovers, we find that is and 
7S with a crossover contribute Ofe(^) to E[M^(A, N)]'^. □ 

Lemma 5.2. The contribution from crossovers in E[M^(A, N^] is Ok{j^)- 

Proof. If neither the i differences nor the j differences have anything un- 
paired (ie, everything is either paired or higher), and there is at least one 
crossover, it is easy to see these terms are Ok{jj). The difficulty occurs 
when we have unmatched singletons on either side. Assume there are un- 
matched differences among the is. We only increase the number of degrees 
of freedom by replacing triple pairings and higher among the is with pairs 
and singletons (note we may lose these degrees of freedom as these must be 
crossed and matched with the js, but we can always cross these over to the 
js with no net loss of degrees of freedom). Similarly, we can remove triple 
and higher pairings among the js. 
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Assume there are Sj > singletons and A; — f - pairs on the i side, Sj > 
singletons on the j side, and C > max(sj, Sj) crossings. Note Sj can equal 
0, if we send the singletons on the i side to matched pairs among the js, but 
C cannot be less than Sj and sj. Note s,, Sj are even. 

On the i side, there are 1 + (/c — + (sj — 1) degrees of freedom; the 
1 is from the freedom of assigning any value to one index, then we have 
k — Y from pairs, and then the last singleton's value is determined, so we 
have just Sj — 1 additional degrees of freedom from singletons. 

Assumesj > 0. Onthej side, there could have been 1 + (A; — Y) + (sj — 1) 
degrees of freedom, but we know we have C crossings. This loses at least 
C — 1 degrees of freedom (it's possible the last, forced j difference already 
equalled an i difference). Thus, the number of degrees of freedom is 



1+(A:-|) + (s,-1)] + [i+(A;-|)+(s,-1)-(C-1)] = 2k+l-^{2C-s~Sj). 

(33) 

If Sj = 0, then there are 1 + k — C degrees of freedom on the j side, and 
we get 2/c + 1 — (C — ^) degrees of freedom. 

Thus, there are at most 2k + 1 degrees of freedom. Doing the combi- 
natorics for choices of sign and number of triples and higher shows these 
terms contribute Okij^)- □ 

Theorem 5.3. The measures fiA,N{x) weakly converge to a universal mea- 
sure of unbounded support, independent of p. 

Proof. As Mk is less than the Gaussian's moments, the M^s uniquely de- 
termine a probability measure, which by SectionlUhas unbounded support. 
As E[Mfc(y4, A^)] Mk and the variances tend to zero, standard arguments 
give weak convergence. □ 

6. Almost Sure Convergence 

6. 1 . Expansions. For convenience in presentation, we assume p{x) is even 
(ie, the odd moments vanish); we remark on the trivial modifications to 
handle the additional book-keeping from general p{x). We will show 

lim E [|M„(A,iV) -E[M„(AiV)]|^] = O ( ^] . (34) 

N^oo yiV y 

The above (plus Chebychev and Borel-Cantelli) will yield almost sure 
convergence. Expanding this out, it is sufficient to study 



E[MUA, NY] - 4E[M^(A, Ar)3]E[M^(A, N)] + 6E[M^(A, Nf]E[M^{A, N)]^ 
-3E[M^(A, N)]E[MUA, N)]\ (35) 
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For even moments, we may write the pieces as 

i j k I 
i j k I 

(36) 

(note we combined the (3) and (^) terms) and so on, where for instance 

El = E[bisbjsbksbis] = IE[6|j^_j2| • • • b\j^^^j^\b\ki-k2\ • ■ • b\k2m-ki\b\h-i2\ • • • b\i2^-h\\ 

(37) 

We fix some notation. Denote the expected value sums above by Ei, E2, E^ 
and Ei (which occur with factors of 1, —4,6 and —3 respectively). For 
h e k, I}, let bh refer to the differences in b\hi-h2\ ■ ■ ■ b\h2m-hi\ If a 
difference in a bh is matched with another difference in b^, we say this is 
an internal matching; otherwise, it is an external matching. By a singleton, 
pair, triple, quadruple and so on, we refer to matchings within a bh (ie, an in- 
ternal matching). Thus, a triple occurs when exactly three of the differences 
in a bh are equal. 

Let Pa denote the a-th moment of p{x). Note p2 = 1- In X] ^^^ibjbkbi], if 
we have all differences occurring twice, except for two different differences 
occurring four times (two quadruples) and another different one occurring 
six times (one sextuple), we would have l^"*~^p|p6- 

Note there are at most Am + 4 degrees of freedom - everything must be 
matched in at least pairs (we have 8m total differences, as we are looking 
at the fourth power of the 2m-th moment), and then each bh has at most one 
more degree of freedom (can choose any index). Thus, any terms with a 
loss of at least two degrees of freedom contribute at most 0{^). 

6.2. Only Pairs and Singletons. We show there is no net contribution if 
there are no triples or higher, and then deal with that case afterwards. 

Lemma 6.1. Assume in addition there are no singletons. Then the contri- 
bution is 0{^). 

Proof. If there are no matchings between 6/jS, then everything is indepen- 
dent, and we get 1 — 4 + 6 — 3 = 0. If two pairs are matched, we lose one 
degree of freedom. There are (2) = 6 ways to choose two out of i,j,k, I to 
share a match. 

For the four expected value sums, we get the following contributions: 
(2)^4 from El, (2)^4 + (6 — (2)) from E2 (three times the two pairs are 
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in the expected value of a product together, giving ^4; the other three times 
they are separated, giving p2 — 1); (2)^4 + (6 — (2)) from £'3 (only once 
are the matched pairs together); (2) from £'4. Combining yields 

1 ■ 6j94 - 4(3p4 + 3) + 6(p4 + 5) - 3(6) = 0. (38) 
If at least three pairs are matched together, or two sets of two pairs are 
matched together, we lose at least 2 degrees of freedom, giving a contribu- 
tion of size 0{^). □ 

The following lemmas are the cornerstone of the later combinatorics: 

Lemma 6.2. If there is a singleton in bh paired with something in bg, then 
there is a loss of at least one degree of freedom. 

Note if every difference in a bh (all singletons) is paired with a differ- 
ence in bg (all singletons), we have a loss of one degree of freedom. We 
can choose any index and 2m — 1 differences in b^, the last difference is 
now determined. Once we choose one index in bg, all other indices are de- 
termined, for a total of 1 + (2m — 1) + 1 (instead of 2m + 2) degrees of 
freedom. Thus, instead of being able to choose 2m differences freely, we 
could only choose 2m — 1. 

Note the above argument holds if instead of all singletons, we have ele- 
ments of bg and bh only matched internally and externally with each other. 

Proof As we can cycle the labels, we may assume that b\h2m-hi\ is the sin- 
gleton. Note that once any index and the values of the other differences are 
given, then |/i2m — ^i| is determined. We would like to conclude it is not 

free, and we have lost a degree of freedom. 

Its value is forced, and it must equal the difference from another bg (h 7^ 
g e {i,j, k, I}), say b\g^-g^^^\. If b\g^-g^^,\ wasn't forced, we have just lost 
a degree of freedom; if it was forced, then we have already lost a degree of 
freedom. □ 

Remark 6.3. In the above, we did not need the matching to be with a sin- 
gleton - a pair, triple or higher would also have worked. 

Lemma 6.4. If at least three of the bhS have a singleton, there is a loss of 
at least two degrees of freedom. 

Proof. If there is a matching of singletons from say bi and bj, and another 
matching from bk and bi, the lemma is clear from above. Without loss of 
generality, the remaining case is when a singleton from bi is matched with 
one from bj, and another singleton from bi is matched with one from bi. We 
then apply the previous lemma to {bj, bi) and {bk,bi). □ 

We can now prove 
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Theorem 6.5. The contribution when there are no triple or higher internal 
pairings is at most 0{^). 

Proof. It is sufficient to show the non-zero contributions all lost at least two 
degrees of freedom. We have already handled the case when there are no 
singletons. If three or four bhS have a singleton, we are done by Lemma 
16.41 If exactly two have singletons, then there is no contribution in the Ei 
through i?4, except for the cases when they are under the expected value 
together (remember the mean of p vanishes). 

We have already lost a degree of freedom in this case; if any pair in any 
bh is matched with a pair in a bg, we lose another degree of freedom. Thus, 
we may assume there are no matches with four or more elements. Thus, 
every difference that occurs, occurs exactly twice. 

There are (2) = 6 ways to choose which two of the four bhS have single- 
tons paired. The contribution from Ei is 6, from E2 is 3 (3 of the 6 times 
they are under the expected value together; the other 3 times they are sepa- 
rated, and the expected value of a difference occurring once is 0), from E3 
is 1 (only 1 of the 6 ways have them under the expected value together), and 
from ^4 is 0. Thus, we have a contribution of 

1-6-4-3 + 6-1-3-0 = 0. (39) 

We are left with the case when the only singletons are in one bh. As we 
are assuming there are no triple or higher internal matchings, these single- 
tons must then be matched with pairs, giving external triples; as the odd 
moments of p{x) vanish, there is no net contribution. □ 

Remark 6.6. If we do not assume the odd moments ofp vanish, additional 
book-keeping yields the contribution is of size If exactly two of the b^s 
have singletons, then each has at least two; we 've already handled the case 
when they are matched together As no difference can be left unmatched, 
we just need to study the case when we get four triples or two triples and a 
pair; each clearly loses two degrees of freedom; 

We are left with the case when only one bh has singletons. We are down 
one degree of freedom already, so there cannot be another non-forced match- 
ing. If there are at least four singletons, we are done. If there are two sin- 
gletons, we get two triples (either with the same or different bgS). Similar 
arguments as before yield the contributions are 

l-6p^-4-3p^ + 6-p^-3-0 = (40) 
if the two external triples involve matchings from bh to the same bg, and 



l-4p^-4-3p^ + 6-0-3-0 = 0. 



(41) 



20 



CHRISTOPHER HAMMOND, STEVEN J. MILLER 



6.3. Eliminating Triple and Higher Matchings. 

Lemma 6.7. If there are no crossovers, there is no net contribution. 

Proof. If there are no crossovers, the expected value of the products are the 

products of the expected values. Thus, each term becomes E[M2m(^, 

and 1 - 4 + 6 - 3 = 0. □ 

Lemma 6.8. If there are at least two triples among all of the b^s, the con- 
tribution is O {^). 

Proof. Everything must be matched in at least pairs (or its expected value 
vanishes). If there are only two values among six differences, then instead 
of getting 3 degrees of freedom, we get 2. This is enough to see decay like 
O (^-^) . If we didn't assume p{x) were even, we would have more work; as 
the odd moments vanish, however, the two triples must be paired with other 
differences, or with each other. In either case, we lose at least one degree of 
freedom from each, completing the proof. □ 

Remark 6.9. Similarly, one can show there cannot be a triple and anything 
higher than a triple. Further, we cannot have two quadruples or more, as 
a quadruple or more loses one degree of freedom ( a quadruple is two pairs 
that are equal - instead of having two degrees of freedom, we now have 
one). 

Lemma 6.10. If there is a quadruple, quintuple, or higher matchings within 
a bh, the contribution is 0{^). 

Proof. There can be no sextuple or higher, as this gives at least three pairs 
matched, yielding one degree of freedom (instead of three). If there is a 
quadruple or quintuple, everything else must be pairs or singletons. As the 
odd moments vanish, a quintuple must be matched with at least a singleton, 
again giving six points matched, but only one degree of freedom. 

We are left with one quadruple (which gives a loss of one degree of free- 
dom) and all else pairs and singletons. No pairs can be matched to the 
quadruple or each other, as we would then lose at least two degrees of free- 
dom. If there are any singletons, by Lemma 1^!^ there is a loss of a degree 
of freedom. If we have a quintuple or higher, this is enough to lose two 
degrees of freedom. Thus, we need only study the case of all pairs and one 
quadruple, with no external matchings. 

As everything is independent, we find a contribution of 



1 ■ P4 - 4 ■ P4 + 6 ■ P4 - 3p4 = 0, 

where p4 is the fourth moment of p. 



(42) 

□ 
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Lemma 6.11. If there is only one triple (say in bh), the contribution is 



Proof. As odd moments vanish, the triple must be paired with a singleton 
from another bh', further, there must be at least one singleton in the same bh 
as the triple (as there are an even number of terms). We thus lose a degree 
of freedom from the triple matched with a singleton (four points, but one 
instead of two matches), and we lose a degree of freedom from the singleton 
in the same bh as the triple (Lemma l^I^ . Thus, we have lost two degrees of 
freedom. □ 

We have proved 

Theorem 6.12. The contribution from having a triple or higher internal 
matching is 0{-^). 

Remark 6.13. Similar arguments work for general p{x). 

6.4. Odd Moments. As the odd moments of p{x) vanish, handling 



is significantly easier. 
Theorem 6.14. We lose at least two degrees of freedom above, implying the 



Proof. In each bh, there is at least one odd internal matching (or singleton); 
thus, only Ei can be non-zero. If there are four (or more) internal triples (or 
higher), we lose at least two degrees of freedom. 

If there are exactly three internal triples, either two are matched together 
and one is matched with a singleton, or all three are matched with single- 
tons; in both cases we lose at least two degrees. 

If there are exactly two internal triples, there must be at least two fe^s with 
singletons. If the triples are matched with singletons, we lose two degrees; 
if the triples are matched together we lose one degree from that, and one 
more degree from the singletons (Lemma 1^^ . 

If there is exactly one triple, at least three bhS have singletons, and similar 
arguments yield a loss of at least two degrees. 

If there are no triples, then by Lemma I01 there is a loss of at least two 
degrees. □ 

Combining Theorems 16.121 and 16 . 1 41 vields 
Theorem 6.15. 





(43) 



expected value is O {^). 




(44) 
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Remark 6.16. Similar arguments work for general p{x). 

6.5. Almost Sure Convergence. We show that we have almost sure con- 
vergence. We first introduce some notation, and then show how this follows 
from Theorem 16. 151 

Fixp(a;) as before. Let Q^- be the outcome space (T^r, YliLi^ Pi^i)d-^i)^ 
where Tjy is the space of all x Real Symmetric Toeplitz matrices. Let 

be the outcome space (T^, Y[p)^ where is the set of all N x N Real 
Symmetric Toeplitz matrices and the product measure built from 

having the entries iidrv from p(x). For each A^, we have projection maps 
from to il^. Thus, if A e Tpj is a Real Symmetric Toeplitz matrices, then 
Ajv is the restriction obtained by looking at the upper left N x N block of 
A. 

We slightly adjust some notation from before. Let haj^ {x)dx be the prob- 
ability measure associated to the Toeplitz N x N matrix Ajy. Then 

fiAr,{x)dx = -^^(5 fx - ^'^ ) 
i=l ^ V / 

Mm{AN) = / x"'fXA^{x)dx 
JR 

Mm{N) = E[Mm{AN)] 

Mm = lim Mm{N). (45) 

As ^ oo, Mm{N) converges to Mm, and the convergence for each 
m is at the rate of The expectation above is with respect to the product 
measure on T/v built from p{x). 

We want to show that, for all m, as A^ — > oo, 

MmiA^) — > Mra almost suTcly. (46) 
By the triangle inequality, 

\MmiAN)-Mra\ < | (v4,v) - M„ ( A^) | + | M„ ( A^) - M„ | . (47) 

As the second term tends to zero, it is sufficient to show the first tends to 
zero for almost all A. 

Chebychev's Inequality states that for any random variable X with mean 
zero and finite m-th moment that 

Prob(|X| > e) < ^— i. (48) 
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Note E[M^(Ajv) - M^{N)] = 0, and by Theorem OSl M^{An) - 
Mm{N) has finite fourth moment. In fact, Chebychev's Inequality and The- 
orem |6lT5l yield 

Prob(|M„(AAr) - M^(A^)| > e) < < ——. 

(49) 

The proof is completed by applying the following: 

Lemma 6.17 (Borel-Cantelli). Let Bi be a sequence of events with Y^- Prob{Bi) < 
oo. Let 

{oo oo 
uj:u;ef][jBA. (50) 
j=i k=j J 

Then the probability of B is zero. 

In other words, an is in i? if and only if that to is in infinitely many B^, 
and the probability of events u which occur infinitely often is zero. 
Fix a large k and let 

= {AeT^: \MUAn) - MUN)\ > (51) 

We have seen that Prob(5jJ''"^) < Thus, for fixed m and k, the 

conditions of the Borel-Cantelli Lemma are met, and we deduce that the 
probability of A G T/v that occur in infinitely many B^j^'"^^ is zero. We now 
let k ^ oo, and find for any fixed m, as — > oo, Mm{AN) — > Mm with 
probability one. Let 5^° be the probability zero sets where we do not have 
such convergence. 

Let _B* °- = |Jm=i ^m'- ^ countable union of probability zero sets has 
probability zero, we see that Prob(i?* ° ) = 0; however, this is precisely the 
set where for some m, we do not have pointwise convergence. 

Thus, except for a set of probability zero, we find MmiA^) — > for 
all m. 

7. POISSONIAN BEHAVIOR? 

As there are only — 1 degrees of freedom for the Toeplitz Ensemble, 
and not 0{N'^), it is reasonable to believe the spacings between adjacent 
normalized eigenvalues may differ from those of full Real Symmetric Ma- 
trices. For example, band matrices of width 1 are just diagonal matrices, 
and there the spacing is Poissonian (e^^^); full Real Symmetric Matrices are 
conjectured to have their spacing given by the GOE distribution (which is 
well approximated by Axe~^^ ). 
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For (i-regular graphs, there are ^ degrees of freedom. It has been nu- 
merically observed (see IIJMRRI among others) that the spacings between 
adjacent eigenvalues look GOE. 

We chose 1000 Toeplitz matrices (1000 x 1000), with entries iidrv from 
the standard normal. We looked at the spacings between the middle 11 
normalized eigenvalues for each matrix, giving us 10 spacings. A plot of 
the spacings between normalized eigenvalues looks Poissonian. 

1 - 

\ 




We conjecture that in the limit as — > oo, the local spacings between 
adjacent normalized eigenvalues will be Poissonian. It is interesting to note 
that Random d-Regular Graphs have a comparable number of degrees of 
freedom; however, in their adjacency matrices, there is significantly more 
independence in the aij - for the Toeplitz Ensemble, we have a strict struc- 
ture, namely depends only on\i — j\. 
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